FairML : ToolBox for Diagnosing Bias in Predictive Modeling

نویسندگان

  • Julius A. Adebayo
  • Lalana Kagal
  • Hal Abelson
چکیده

Predictive models are increasingly deployed for the purpose of determining access to services such as credit, insurance, and employment. Despite societal gains in efficiency and productivity through deployment of these models, potential systemic flaws have not been fully addressed, particularly the potential for unintentional discrimination. This discrimination could be on the basis of race, gender, religion, sexual orientation, or other characteristics. This thesis addresses the question: how can an analyst determine the relative significance of the inputs to a black-box predictive model in order to assess the model's fairness (or discriminatory extent)? We present FairML, an endto-end toolbox for auditing predictive models by quantifying the relative significance of the model's inputs. FairML leverages model compression and four input ranking algorithms to quantify a model's relative predictive dependence on its inputs. The relative significance of the inputs to a predictive model can then be used to assess the fairness (or discriminatory extent) of such a model. With FairML, analysts can more easily audit cumbersome predictive models that are difficult to interpret. Thesis Supervisor: Dr. Lalana Kagal Title: Principal Research Scientist, CSAIL Thesis Supervisor: Professor Harold Abelson Title: Class of 1922 Professor of Computer Science and Engineering Thesis Supervisor: Professor Alex "Sandy" Pentland Title: Toshiba Professor of Media Arts and Sciences

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Toolbox for Generating Realistic Biological Cell Geometries for Electromagnetic Microdosimetry

Researchers in bioelectromagnetics often require realistic tissue, cellular and sub-cellular geometry models for their simulations. However, biological shapes are often extremely irregular, while conventional geometrical modeling tools on the market cannot meet the demand for fast and efficient construction of irregular geometries. We have designed a free, user-friendly tool in MATLAB that comb...

متن کامل

An in silico modeling toolbox for rapid prototyping of circuits in a biomolecular "breadboard" system

In this paper, we develop an experimentally validated MATLAB software toolbox as an accompaniment to an in vitro cell-free biomolecular “breadboard” system. The toolbox gives insight into the dynamics of unmeasured states in the cell-free system, accounting especially for the resource usage. Parameter lumping and the reduced order modeling are used to maintain computational tractability and to ...

متن کامل

Cellular S-value of beta emitter radionuclide’s determined using Geant4 Monte Carlo toolbox, comparison to MIRD S-values

Introduction: Spatial dose distribution around the radionuclides sources is required for optimized treatment planning in radioimmunotherapy. At present, the main source of data for cellular dosimetry is the s-values provided by MIRD. However, the MIRD s-values have been calculated based on analytical formula in which no electrons straggling is taken to account. In this study, we used Geant4-DNA...

متن کامل

Providing A Model for Management Earnings Forecast Bias

Despite The Important Role That Management Profit Forecasting Plays In The Decision Making Of Capital Market Actors, These Predictions Appear To Be Biased. In The Attempt To Measure The Bias Of Predicting Profit Management, Numerous One- Dimensional Measurement Tools Have Been Proposed In The Accounting And Finance Literature. Despite These Efforts, No Comprehensive Composite Index Has Been Dev...

متن کامل

Development of A New Recurrent Neural Network Toolbox (RNN-Tool)

In this report, we developed a new recurrent neural network toolbox, including the recurrent multilayer perceptron structure and its companying extended Kalman filter based training algorithms: BPTT-GEKF and BPTT-DEKF. Besides, we also constructed programs for designing echo state network with single reservoir, together with the offline linear regression based training algorithm. We name this t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017